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Abstract.  ‘'Peaks  over  Threshold"^^P0fFu‘)'  models  commonly  used  e.g.  in 

hydrology,  assume  that  peak  values  of  an  iid  or  stationary  sequence  above  a 

high  value  u,  occur  at  Poisson  points,  and  the  excess  values  of  the  peak  above 

u  are  independent  with  an  arbitrary  common  d.f.  G.  Motivation  for  these  models 

/ - >  f - - 

has  been  provided  by  Rrfcf  Smith  -fefv  ■  [8]  by  using  Pare  to- type 

approximations  of  Pickandsf^-f®??  for  distributions  of  such  excess  values. 

These  works  strongly  suggest  that  the  Pareto  family  provides  the  appropriate 
class  of  distributions  G  for  the  POT  model. 

Tn  the  present  paper**,  cons ideri the  point  process  of  excess  values  of 

r\ 

peaks  above  a  high  level  u  and  demonstrate  that  this  converges  in  distribution 

\cxp^ackcs  Lm?  of _ .. 

to  a  Compound  Poisson  Process  as  ir  "  J  Sunder  ‘appropriate" assumptions .  It  is 
shown  that  the  multiplicity  distribution  of  this  limit  (i.e.  the  limiting 
distribution  of  excess  values  of  peaks)  must  belong  to  the  Pareto  family  and 
detailed  forms  are  given  for  the  normalizing  constants  involved.  This  exhibits 
the  POT  model  specifically  as  a  limit  for  the  point  process  of  excesses  of 
peaks  and  delineates  the  distributions  involved. 
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1.  Introduction. 


In  what  are  sometimes  called  "Peaks  over  Threshold"  (POT)  models  (cf. 

[7]).  the  excess  values  over  a  high  level  u  by  an  observed  time  series  are 

assumed  to  occur  at  Poisson  points  and  to  have  arbitrary  common  distributions. 

That  is  if  (X^:  1=1,2,...}  is  e.g.  a  stationary  (or  iid)  sequence,  the 

exceedance  points  (i:  >  u}  are  assumed  to  be  Poisson  and  the  corresponding 

excess  values  (Xi-u)+  to  be  independent  with  an  arbitrary  distribution. 

The  Poisson  nature  of  the  occurrence  of  exceedances  is  intuitively  clear 

since  e.g.  if  X^  are  iid  with  d.f.  F,  the  number  of  exceedances  of  a  high 

level  u  by  X, . X  ,  is  binomial  with  parameters  (n,  1-F(u  ))  and  hence 

n  ln  v  n 

approximately  Poisson  if  n(l-F(un))  converges  to  some  value  t  >  0.  This  will 
be  made  more  precise  below  by  a  time  normalization.  Further  motivation  for  the 
model  is  provided  by  R.L.  Smith  ([7], [8])  based  on  theory  of  Pickands  [6], 
restricting  the  distribution  for  excess  values  to  a  "generalized  Pareto”  (GP) 
form 

(1.1)  Ga  ^(x)  =  1  -  (1  +  ax/p)~l/a  p  >  0,  a  *  0 

=  1  -  e_x/P  p  >  0.  a  =  0 

where  the  range  of  x  is  (0,®)  if  a>0  and  (0,-a  *0)  if  a<0. 

This  class  is  a  flexible  2-parameter  family,  but  more 
wide  class  of  F's  the  excess  distribution 


importantly  for  a 


(1.2)  Fu(x)  =  P(X  -  u  *  x|X  >  u} 

is  approximately  GP  in  a  sense  shown  by  Pickands  [6],  viz. 


(1.3)  inf  sup  |F  (x)  -  G  (x)  |  — >  0  as  u  — »  » 

P  x  ,p 
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for  some  fixed  a.  That  is  for  high  levels  u.  (3  may  be  chosen  (depending  on  u) 
so  that  ^(x)  is  uniformly  close  to  Fu(x). 

Write  Xp  for  the  right  endpoint  (sup{x:  F(x)  <  1}  of  the  d.f.  F  and 
F(x)  =  l-F(x),  the  tail  of  F.  Then  the  class  of  d.f.’s  F  for  which  the  above 
GP  approximation  holds  includes  all  those  satisfying 

(1.4)  F(u+xg(u))/F(u)  — ►  G(x)  as  u  — »  Xp 

for  some  function  g(u)  >  0,  some  d.f.  G  and  al 1  0  <  x  <  Xp  (£  °°) .  It  is  known 

([6])  that  any  such  G  in  (1.4)  must  be  G.P.  as  in  (1.1)  for  some  a.p. 

Note  that  (1.4)  holds  for  all  d.f.'s  F  of  interest  in  extreme  value  theory 

i.e.  such  that  if  M  =  max(X, . X  ),  P{a  (M  -b  )  $  x)  (=  F^x/a  +b  ))  has  a 

n  v  1  n'  1  nv  n  n7  J  v  v  n  n" 

non  degenerate  limit  A(x).  For  example  if  G(x)  =  e  .  (1.1)  is  a  classical 
domain  of  attraction  criterion  for  a  "Type  I"  extreme  value  distribution 
A(x)  =  exp(-e  X) .  If  instead  F  has  a  regulary  varying  tail  (l-F(ux) )/(l-F(u) ) 
— »  x-0  .  a>0,  as  u  — »  «,  each  x  >  0.  then  (1.1)  holds  with  G(x)  =  (l+x)-*1. 
g(u)  =  u,  and  A(x)  is  then  "Type  II"  i.e.  A(x)  =  expt-x"^1).  x  >  0. 

Hence  in  a  wide  variety  of  cases  of  interest  the  distribution  Fu(x)  of 
excesses  (given  by  (1.2))  of  the  level  u  is  approximately  GP,  G  „(x)  in  the 
sense  stated,  where  a  is  fixed  but  |3  can  change  with  u.  As  discussed  in  [7] 
this  provides  significant  intuitive  support  for  the  POT  model.  Our  purpose 
here  is  to  further  justify  the  model  by  exhibiting  it  as  the  limit  of  point 
processes  of  excesses  of  high  levels,  the  limiting  distribution  of  excess 
values  being  shown  to  be  GP,  G  „(x)  where  now  (3  as  well  as  a,  is  independent 

(X  f  p 

of  u. 

For  clarity  this  will  be  shown  for  iid  sequences  in  Section  2  and  extended 
to  dependent  (mixing)  situations  in  Section  3.  In  the  latter  case  high  serial 
correlation  can  cause  clustering  of  exceedances  of  high  levels  and  the  peak 


3 


values  are  then  defined  to  be  the  largest  in  each  cluster.  Two  points  worth 
noting  are  (i)  the  dependence  modifies  the  theory  via  the  introduction  of  a 
single  parameter,  the  "extremal  index"  (essentially  the  inverse  of  mean  cluster 
size)  and  (ii)  the  GP  form  applies  to  the  peak  values  but  not  necessarily  to 
other  cluster  properties  such  as  their  lengths.  A  corresponding  theory  will  be 
indicated  in  Section  3  for  other  such  cases. 

2.  The  iid  case. 

Let  (un)  be  a  sequence  of  levels  such  that 
(2.1)  n(l-F(un))  —  r  >  0. 

Define  point  processes  Nn  to  consist  of  the  points  i/n  for  which  Xj  >  u^,  i.e. 
the  exceedance  points  normalized  by  1/n.  Nn  is  thus  defined  on  the  positive 
real  line  but  it  will  be  convenient  to  restrict  attention  to  the  unit  interval, 

corresponding  to  exceedances  among  the  n  sample  values  X j . Xr.  Hence  Nn(B) 

is  defined  for  (Borel)  subsets  B  C  (0,1]  by  Nn(B)  =  #{i/n  e  B:  X{  >  un) . 

It  is  trivial  to  show  that  under  (2.1)  that  ^  N  where  N  is  a  Poisson 
Process  on  (0,1]  with  intensity  r.  For  if  I  =  (a.b]  C  (0.1],  N^fl)  is  binomial 
with  parameters  [nb]-[na],  pn=l-F(un)  ([  ]  denoting  integer  part)  and  npn  — »  r 
so  that 

P{Nn(I)  =  r)  —  e‘T(b~a)[T(b-a)]r/r!  =  P(N(I)  =  r). 

Hence  Nn(I)  N(I)  and  by  independence  if  1^ . 1^  are  disjoint 

(Nn{Il)*-  -W)  ^  (Nflj). . -N(Ik)) 
which  is  sufficient  ([2])  to  show  full  weak  convergence  N^  N. 

Now  associate  with  each  point  of  N  (i/n  such  that  X.  >  u  )  the 

n  v  in7 

corresponding  excess  value  X^-u^  to  give  the  "point  process  of  excesses". 
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Technically  perhaps  Nr  should  be  regarded  as  a  "marked  point  process"  (or  as  an 

atomic  random  measure)  since  the  (X^-u^)  are  not  necessarily  integer  valued  but 

it  will  be  convenient  (and  legitimate)  to  call  it  a  point  process  whose  events 

(at  (i/n:  >  un))  have  (not  necessarily  integer)  multiplicities  (X^-u^. 

Since,  as  above,  the  positions  of  the  excesses  converge  to  a  Poisson 

Process,  it  seems  evident  that  one  should  expect  any  limit  N  for  Nn  to  be  a 

compound  Poisson  process  having  events  at  Poisson  points  with  intensity  t  and 

(independent)  multiplicities  with  some  common  d.f.  G.  Such  a  result  may  be 

shown  (cf.  [1])  but  here  we  give  sufficient  conditions  for  such  convergence. 

* 

Here  and  below  we  write  N  =  CP(t,G)  to  denote  a  Compound  Poisson  point  process 
whose  Poisson  events  have  intensity  r  with  (not  necessarily  integer-valued) 
multiplicities  having  d.f.  G. 

Theorem  2.1.  Let  X^,  1=1.2...  be  lid  with  d.f.  F  satisfying  (1.4)  for  some  g. 
G.  and  let  (ur)  be  levels  satisfying  (2.1).  Then  anN*  N*.  CP(t,G)  where  r 
is  as  in  (2.1),  G  as  in  (1.4)  and  a^  =  l/gfu^) . 

Proof :  If  G  (x)  (=  F  (x/a  ))  denotes  the  conditional  d.f.  of  a  (X,-u  )  given 

n'  '  '  u  '  n"  n  1  n 

n 

Xj  >  un>  then  clearly 

«exp{-san(X1-un)+)  =  F(un)  +  F(un)  J*”e_sxdGn(x) . 

But  from  (1.4).  Gn(x)  — ►  G(x)  and  hence  J*q  e  sxdGn(x)  — *  $(s)  =  f^e  sxdG(x)  so 
that  if  I  =  (a.b]  C  (0,1]  contains  m^  points  i/n  (mn  ~  n(b-a)) 

S  nexp{-san(Xrun)+}  =  [l-(r/n)(l-^(s))(l+o(l))]mn 

_»  e-T(b-a)(l-<Ks)) 

which  is  the  Laplace  Transform  Se  where  N*  is  CP(t,G).  Since  N*(I)  is 


5 


the  Siam  2  (X.-u  ),  of  m  iid  terms,  it  follows  that  a  N  (I)  — »N  (I).  Hence 

,  ,  i  n'  +  n  n  nv  ' 

i/n€I 

by  independence 


(a  N  (IJ.-.a  N  (I  ))  ^  (N  (I,) _ N  (I.  )) 

v  n  nv  n  nv  n"  v  nv  V  nv  k" 


for  any  disjoint  1^ . 1^  so  that  a^N*  N*  as  required. 


□ 


Thus  in  the  iid  case  the  (normalized)  point  process  anNn  of  excesses  over 
ur  has  a  Compound  Poisson  limit  and  may  be  regarded  as  approximately  CP(t,G) 
for  large  n.  Note  again  that  G  is  a  GP  distribution  G^  ^  for  some  fixed  a, {3. 
This  provides  a  strong  basis  for  the  POT  model  in  the  iid  case. 


3.  Stationary  sequences 

A  stationary  sequence  Xj.Xg.--.  can  exhibit  both  "long  range”  and  "local" 

dependence  between  the  X^ .  Here  the  former,  but  not  the  latter,  will  be 

restricted  by  a  "mixing  condition"  which  enables  the  collection  of  X.’s  into 

groups  which  are  approximately  independent  but  with  possible  high  dependence 

within  groups.  Strong  mixing  will  suffice  to  restrict  long  range  dependence. 

However  one  may  "tailor"  this  to  the  problem  at  hand  with  a  slightly  weaker 

restriction  A(v^)  defined  for  any  sequence  of  constants  vr  as  follows.  For 

llj£k£n  write  38 yn)  =  CT{(^s-vn)+:  and  say  that  A(v^)  holds  if  for 

ll«<n  | P( AfTB)  -  P(A)P(B)|  i  g  whenever  A  C  Sj  j(vn)*  B  €  n^Vn^' 

and  a  „  — *  0  for  some  8  =  o(n). 

n,«  n  v  ’ 

n 

High  local  dependence  is  reflected  in  the  presence  of  clustering  of 

exceedances  of  high  levels.  "Clusters"  will  be  defined  precisely  below,  but  we 

first  note  that  for  levels  (u^)  satisfying  (2.1)  the  mean  size  of  a  cluster 

typically  converges  to  a  parameter  whose  inverse  0  is  sometimes  called  the 

"extremal  index"  of  the  sequence  {X^}.  Specifically  0  is  defined  by  the 

property  that  if  M  =  max(X. ,X„ . X  )  and  u  satisfies  (2.1)  then 

n  i  z  n  n 
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(3.1)  P(Mn  i  un)  . 

0  being  independent  of  r.  (For  i.i.d.  sequences  0=1).  Such  0  exists  under 
general  conditions  (cf  [4],  Sec.  3.7) 

Clusters  of  exceedances  of  high  levels  may  be  defined  in  various  ways,  the 
most  obvious  being  as  runs  of  consecutive  exceedances.  However  if  A(vn)  holds 
for  a  given  sequence  (v^)  of  levels  the  following  "block  definition"  is  more 
convenient  (and  often  asymptotically  equivalent  to  that  using  runs).  Choose 
integers  kR  — ►  «,  k^  =  o(n) ,  satisfying 

(3.2)  k  (a  .  +  £  /n)  -+  0. 

v  ’  nv  n.B  n  7 

n 

Write  rn=[n/kn]  and  divide  the  integers  (l...n)  into  consecutive  blocks 
Jj  =  {(i-l)rn+l.  (i-l)rnt2 . lrn> 

4  *1  *  <Vn+1>  Vn*2 . "> 

n 

and  regard  the  exceedances  (if  any)  in  a  block  as  forming  a  cluster.  The 
choice  of  kn  (or  equivalently  r  )  is  flexible,  subject  to  the  growth 
restriction  (3.2). 

Obviously  this  block  definition  can  count  a  run  of  consecutive  exceedances  as 

two  or  more  clusters,  if  the  run  straddles  more  than  one  block  so  that  the 

block  definition  is  less  natural  in  some  cases.  However  it  can  be  more 

appropriate  than  the  runs  definition  for  sequences  with  high  local  variability. 

In  any  case  we  use  the  block  definition  for  its  convenience.  Further  if  the 

block  J.  contains  exceedances  the  first  point,  (i-l)r  +1,  of  J.  will  be 
i  n  x 

regarded  as  the  location  of  the  cluster  in  J^. 

Define  now  a  point  process  Pn  of  normalized  cluster  positions,  i.e. 
consisting  of  the  points  (((i-l)rn+l)/n,  1  £  i  £  k^:  M(J^)  >  u^}  where  M(E)  is 
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written  for  max(Xj:  jeE).  Associate  with  each  point  ((i-l)rn+l)/n  of  Pn  the 
corresponding  maximum  excess  M(J^)-un  =  max{(Xj-un)+:  j  e  J^},  giving  the 
(again  technically  "marked")  point  process  of  peak  excess  values  above  the 
level  in  the  clusters.  We  show  that  a  result  like  Theorem  2.1  holds  in  the 
dependent  case  with  Pr  replacing  N  .  In  fact  it  may  be  shown  (cf  [5])  that  for 
i.i.d.  sequences  Nn  and  Pr  are  asymptotically  equivalent  in  a  strong  sense  as 

&  &  H 

also  are  N  and  P  so  that  P  and  P  generalize  N  and  N  . 

n  n  n  n  n  n 

It  was  shown  in  Section  1  that  in  the  iid  case  the  exceedance  point 

process  has  a  Poisson  limit  with  intensity  t.  This  result  is  simply 

generalized  (cf.  [1])  to  show  that  under  dependence  Pr  has  a  Poisson  limit 

(with  intensity  0r)  whereas  Nn  itself  has  a  Compound  Poisson  limit  whose  events 

occur  a*-  the  positions  of  clusters  (i.e.  points  of  P  ),  with  multiplicities 

given  by  (limiting)  cluster  sizes.  However  the  (limiting)  distribution  for 

multiplicities  need  not  have  a  GP  form  and  need  not  be  totally  determined  by 

the  (tail  of  the)  marginal  d.f.  F  of  the  XA .  On  the  other  hand  it  will  be 

shown  below  that  P  has  a  CP(0t,G)  distributional  limit  where  G  is  obtained 
n 

from  the  tail  of  F  via  (1.4).  Thus  G  has  GP  form  and  the  dependence  influences 
the  limit  only  through  the  factor  0  in  the  intensity  of  the  underlying  Poisson 
Process. 

The  Compound  Poisson  limit  for  Pr  will  be  obtained  by  considering  the 
asymptotic  behavior  of  the  maximum  in  a  cluster  and  showing  that  the  clusters 
are  essentially  independent.  The  specific  basic  results  needed  are  contained 
in  the  following  lemma 

Lemma  3.1.  Let  {X^}  be  stationary  with  extremal  index  0  >  0  and  marginal  d.f. 

F  satisfying  (1.4),  and  let  A(v  )  hold  with  v  =u  +xg(u  ),  each  x  >  0,  where  u 

satisfies  (2.1).  Let  {k^}  satisfy  (3.2)  and  write  a^  =  (gfu^))  *.  Then  (with 

r  =  [n/k  ]),  as  n  — ► 
n  L  nJ/ 


00 
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(i)  P{an(Mr  -un)  >  x}  ~  g^G(x).  x  l  0 

n  n 

(ii)  P{an(Mr  -un)  *  x|Mr  >un}  *  G(x) 

n  n 

k 

(iii)  [*exp{-san(Mr  -un)+}]  n  — »  exp{-0r(l-«(s))} 

n 

where  f(s)  =  J*q  e  sxdG(x) 

Proof:  Since  n[l-F(un+an^x)]  ~  rF(un+xg(un))/F(un)  — >  tG(x)  and  {X^}  has 
extremal  index  0,  it  follows  that 

p(Mn  £  un+anlx^  “ *  exp(-0rG(x)) . 

But  it  follows  in  a  standard  way  from  the  mixing  condition  A(vn)  (cf.  [1.  Lemma 

k  k 

2.3])  that  P{M  Sv  }  -  P  "{M  *v  }  — »  0  so  that  P  n{M  +a_1x)  — »  exp(-0TC(x)) 
■*  nn  rn  rnn 

n  n 

from  which  (i)  readily  follows.  The  left  hand  side  of  (ii)  is 

1  -  P{Mr  >un+a;1x}/P{Mr  >un)  =  1  -  |lG(x)/^  (l+o(l)) 
n  n  n  n 

by  (i),  so  that  (ii)  follows.  Finally  (iii)  follows  by  the  first  calculation 

of  Theorem  2.1  with  M  replacing  Xj,  using  (ii).  0 

n 

The  main  result  now  follows  in  a  similar  way  to  Theorem  2.1  on  using 
approximate  independence  between  the  clusters. 

Theorem  3.2.  Let  (X  1  be  stationary  with  extremal  index  0  >  0.  and  marginal 

-  V 

d.f  F  satisfying  (1.4),  and  let  A(vn)  hold  with  vn  =  un  +  xg(un)  each  x  >  0, 

where  u  satisfies  (2.1).  Let  k  — »  “>  satisfy  (3.2),  r  =  [n/k  ]  and 
n  n  n  n 

'“1  m 

a^  =  (gfu^))  .  Then  the  point  process  P^  of  peak  values  above  satisfies 

a^P*  P*  where  P*  is  CP(0t,G)  and  G  is  as  in  (1.4). 
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Proof:  Let  I  be  a  subinterval  of  (0,1]  and  J.  =  {(i-l)r  +1 . ir  }  as  defined 

above,  1  <  i  <  k  .  Then  it  follows  from  the  mixing  conditions  along  the  same 
n 

lines  as  Lemma  2.2  of  [3]  (or  Lemma  2.2  of  [1])  that 

«exp{-sa  P*(I)}  -  n  «exp{-sa  P*(J  )}  — ►  0 
n  n  (i^CI) 


as  n  — ►  00 ,  so  that 

*  k m(I)(l+o(l)) 

5exp(-sanPn(I)}  =  [Sexp{-san(Mr  -un)+}] 

n 

— ♦  exp{-0Tm(I)(l-^(s))} 

by  Lemma  3.1,  where  $(s)  =  J*q  e  SXdG(x).  Hence  a^P^fl)  P*(I)  where  P*  is 

CP(0t,G)  on  (0.1].  Now  if  Ij . Ik  are  disjoint  subintervals  of  (0,1]  it 

follows  also  as  in  Lemma  2.2  of  [3]  (or  Lemma  2.2  of  [1])  that 

Jexpf^V  Jf1SjPn(Ij,>  '  J|[1{eXp('anPn(Ij»  “*  0 
from  which  it  follows  that 


(Ik» 


and  hence  that  a  P*  P**. 

n  n 


0 


Finally  we  reiterate  that  this  result  is  one  of  many  which  can  be  obtained 

involving  different  aspects  of  cluster  structure.  For  example  the  Compound 

Poisson  limit  for  N  was  cited  above,  the  multiplicities  corresponding  to 

n 

cluster  sizes.  More  complicated  functions  -  such  as  the  sum  of  powers  of 
excess  values  in  a  cluster  -  may  also  be  considered  and  will  lead  to  Compound 
Poisson  limits.  Such  cases  may  be  useful  in  applications  where  damage  from 
high  levels  (e.g.  high  pollution  episodes)  may  be  modeled  as  a  specific 
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function  of  the  excess  values.  However  the  case  of  (excess)  peak  values  in  a 
cluster  is  especially  important  (e.g.  in  describing  severe  floods,  where  damage 
can  well  be  a  function  of  flood  level).  Theorem  3.2  establishing  the  POT 
approximation  is  particularly  useful  since  the  multiplicity  distribution  then 
depends  only  on  the  marginal  d.f.  F  and  moreover  is  known  to  have  GP  form. 
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